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Severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) evolu- 
tion is expected, given the nature of 
virus replication. Selection and 
establishment of variants in the 
human population depend on viral 
fitness and on molecular and im- 
munological selection pressures. 
Here we discuss how mechanisms 
of replication and recombination 
may contribute to the emergence 
of current and future variants of 
SARS-CoV-2. 


SARS-CoV-2 emerged in December 2019 
to cause a worldwide pandemic. Rapid 
sharing of information by researchers 
enabled the development of vaccines in 
record time. However, as SARS-CoV-2 
continues to spread, it is evolving. This 
evolution should come as no surprise 
since RNA viruses are known to mutate, 
given their error-prone replication process. 
As SARS-CoV-2 circulates, mounting 
selection pressures, such as natural 
infection- and vaccine-mediated immunity 
will influence the selection of viral variants. 
In this forum article we discuss how two pro- 
cesses — replication and recombination — 
contribute to the generation of current and 
future SARS-CoV-2 variants. 


Replication-Induced Genomic 
Errors 

Coronaviruses (CoVs), such as SARS- 
CoV-2, are RNA viruses that use an 
RNA-dependent RNA polymerase (RdRp) 
to generate genomic copies. The RdRp is 
intrinsically error-prone, thus promoting 


virus evolution through replication- 
associated changes (Figure 1). The muta- 
tion rate is kept low by the proofreading 
activity of a virus-encoded 3' exonuclease, 
such as nsp14 for SARS-CoV-2. Despite 
proofreading activity, SARS-CoV-2 accu- 
mulates genetic changes over time as 
evidenced by emerging variants. Under- 
standing the molecular factors that lead 
to the generation of SARS-CoV-2 vari- 
ants, along with extrinsic factors that 
lead to the selection and establishment 
of these variants, is critical to control the 
ongoing pandemic. The coronavirus dis- 
ease 2019 (COVID-19) pandemic has 
also reignited interest in studying wildlife 
reservoirs. Selection analyses suggest 
that most adaptive changes within 
SARS-CoV-2 genomes occurred in wild- 
life reservoirs, likely bats, prior to emerg- 
ing in human populations [1]. It would be 
highly valuable to assess the rate of accu- 
mulation of replication-induced errors in 
the genomes of SARS-CoV-2 or SARS- 
CoV-2-related viruses in other mamma- 
lian species, such as bats, to determine 
the true diversity of these viruses in 
wildlife reservoirs. It remains unknown if 
infected bats produce a more diverse 
quasi-variant population of sarbecoviruses 
compared with other permissive mamma- 
lian species. 


The biochemical make-up of different cell 
types from different hosts may impact CoV 
replication and rate of error. For example, 
the catalytic reaction of viral polymerases 
requires certain substrates, the quality and 
quantity of which may differ between cell 
types or cells from alternative hosts [2]. 
Host factors, including enzymes, can also 
impact the evolution of viral variants. Inher- 
ent differences in the coding sequence of 
host enzymes, such as TMPRSS2, become 
more prominent between distantly related 
mammalian species. When a zoonotic 
virus such as SARS-CoV-2 spills over into 
anew mammalian species, the virus inter- 
acts with a different ortholog of TMPRSS2. 
Although the spike protein may be 
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sufficiently cleaved in this new host to facili- 
tate cellular entry, over time the virus popu- 
lation is likely to evolve and select for 
variants with higher cleavage efficiency 
with the TMPRSS2 ortholog in the new an- 
imal species [3]. Indeed, furin cleavage sites 
in SARS-CoV-2-related CoVs (SARS2r- 
CoVs), discovered in bats, differ in their 
sequence from human isolates of SARS- 
CoV-2 [4], suggesting possible species- 
specific adaptation in host protease utiliza- 
tion by the virus. 


Accumulation of genomes with replication- 
associated random mutations within the 
same cell has the potential to generate a 
heterogeneous mix of viral proteins, includ- 
ing spike protein, which brings into ques- 
tion the true diversity of the spike on the 
surface of individual virions and subse- 
quent tissue-level spread of SARS-CoV-2 
and its variants. If a more orderly molecular 
process to regulate the homogeneous dis- 
tribution of spike protein variants on individ- 
ual virions exists, it remains to be identified. 
Regular RNA sequencing analyses are un- 
able to identify these differences since 
they can only be captured by ultra-high- 
resolution single-virion protein sequencing, 
the technology for which is in its infancy. 


The SARS-CoV-2 genomic mutation rate in 
humans is estimated at 0.8-2.38 x 10° 
nucleotide substitutions per site per year 
largely based on analysis of sequencing 
data archived in public repositories [5]. 
Emerging studies are attempting to confirm 
this mutation rate using experimental inves- 
tigations [6]. By comparison, mutations 
rates for influenza A virus and Middle East 
respiratory syndrome CoV (MERS-CoV) 
are 2.3 x 10° and 1.12 x 10° nucleotide 
substitutions per site per year, respectively 
[7-10]. Human CoVs (HCoV)-OC43 and 
HCoV-229E have an average mutation 
rate of 3-6 x 10% nucleotide substitutions 
per site per year [8]. Emerging experimental 
data suggest that SARS-CoV-2 is capable 
of mutating and accumulating changes 
when facing a new cell type, albeit in the 
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Figure 1. Replication-Associated Generation of Severe Acute Respiratory Syndrome Coronavirus 2 
(SARS-CoV-2) Variants. Replication-associated mutations and intrinsic selection factors play a critical role in 
the evolution and subsequent emergence of SARS-CoV-2 variants from infected cells. SARS-CoV-2 genome 
[positive (+) sense single stranded (ss)RNA] is released into the cytoplasm after virus binding to the angiotensin- 
converting enzyme 2 (ACE2) receptor and entry (A; red strand). The viral replicase complex transcribes the 
genome into negative (-) sense ssRNA (B; green strand) which acts as a template for subsequent synthesis of 
full-length +sense ssRNA genome copies (C). During replication, errors introduce mutations within the SARS- 
CoV-2 genome (D), which lead to mutations within viral proteins (E). While one copy of the genome is packaged 
into each SARS-CoV-2 virion (F), the heterogeneity of spike proteins on the surface of each virion (represented 
by different colors) remains unknown. Finally, SARS-CoV-2 variants carrying +sense ssRNA genomes with a 
varying number of mutations are released from the infected cell following egress (G). 


absence of immune surveillance in a single- 
cell-type infection model [6]. A clinical study 
also reported the rapid evolution of SARS- 
CoV-2 variants in the presence of antibod- 
ies from convalescent plasma therapy [11]. 
While random errors during replication may 
induce genetic mutations in SARS-CoV-2, 
multiple extrinsic factors, such as individual 
and population-level immunity, play a vital 
role in the selection of these variants. 
More research is warranted to fully under- 
stand the cellular and molecular drivers 
of genomic mutation and selection in 
SARS-CoV-2. As SARS-CoV-2 accumu- 
lates new genetic changes, we shall need 
to reassess the mutation rate to better 
understand the contribution of replication- 
associated random mutations and its im- 
pact on SARS-CoV-2 transmission and 
emergence of new variants. 


Recombination-Associated 
Mutations 

While replication-associated mutations 
generate small changes in the viral 
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genome, recombination may introduce 
more important modifications, leading to 
dramatic changes in the phenotype of 
SARS-CoV-2 (Figure 2). Discontinuous 
transcription of CoV genomes enables 
recombination in a cell coinfected with 
more than one CoV species or variant 
via ‘strand switching’ by the viral RdRp 
(Figure 2A) [12]. This process leads to the 
production of chimeric subgenomic RNA 
and proteins. Chimeric proteins may have 
consequences for the fate of infected 
cells and for cellular and tissue tropism of 
progeny virions. As technology evolves, 
we might be able to track the spread of 
individual virions, along with deciphering 
the heterogeneity of their composition. 


SARS-CoV-2 genomes can also undergo 
homologous recombination, in which 
RNA molecules recombine via regions of 
high similarity (Figure 2B). Recombination 
events are common for beta-CoVs and 
have been observed between SARS- 
related CoVs in wild-caught bats [13,14]. 
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Data suggest that recombination within 
HCoVs, including SARS-CoV-2, does 
occur [15]. Recently, eight potential re- 
combination events among SARS-CoV-2 
genomes have been identified. Four of 
eight were noted within structural genes, 
while two of eight were noted within the 
spike gene. It is important to note that 
these recombination events were not 
found across all subsampled datasets 
[15], and the frequency of SARS-CoV-2 
recombination and factors that influence 
the rate of recombination still remain 
elusive. It is vital to monitor SARS-CoV-2 
recombination as part of ongoing surveil- 
lance and diagnostic efforts, especially 
among circulating variants and also with 
other beta-CoVs. These studies will be 
crucial to assess the evolutionary trajec- 
tory of SARS-CoV-2 within the ongoing 
pandemic, along with predicting the long- 
term efficacy of vaccine- and natural 
infection-mediated immunity. 


The Need to Investigate the 
Molecular Determinants of 
SARS-CoV-2 Variant Emergence 
Tissue and cell tropism, and receptor dis- 
tribution, are important determinants of 
coinfection by two closely related CoVs, 
and hence, the probability of recombina- 
tion. However, little is known about other 
viral and host factors that are required to 
permit or facilitate CoV recombination. 
Much work remains to be done to fully 
characterize the replication machinery of 
SARS-CoV-2 and the fidelity of its RdRp 
in different species. Understanding the 
replication process will be critical to deter- 
mine if SARS-CoV-2 is more prone to 
replication-induced errors and/or recom- 
bination, relative to other highly patho- 
genic CoVs, such as SARS-CoV and 
MERS-CoV. 


The ongoing COVID-19 pandemic has 
thrust CoV research to the forefront and 
we are making tremendous progress in 
understanding coronavirus pathogenesis 
in general. While the focus so far has 
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Figure 2. Recombination-Induced Variants. (A) Strand switching. Discontinuous transcription facilitates the generation of potential chimeric subgenomic RNA 
(sgmRNA) in cells that are infected with more than one species or variant of Coronavirus (CoV). A CoV (CoV 1) entering (a) an already-infected cell will release its 
genomic RNA in the cytoplasm (b), which will undergo discontinuous transcription (c) to generate sgmRNA (d). During discontinuous transcription, the viral RNA- 
dependent RNA polymerase (RdRp) may inadvertently switch strands and jump (e) onto a template from a second CoV (CoV 2), leading to the synthesis of chimeric 
sgmRNA (CoV1+CoV2) (f). (B) Homologous recombination. Similar to recombination between homologous DNA sequences, regions of homology between two CoV 
genomes (CoV1 and CoV2) may facilitate breakage and rejoining of genomes to form chimeric genomes (CoV1+CoV 2). 


largely been on developing vaccines 
and therapeutics against SARS-CoV-2, 
as effective vaccines are rolling out, re- 
searchers are focusing on the biology 
of SARS-CoV-2. Many questions about 
SARS-CoV-2 biology still remain — do 
some host or selected cell types favor 
recombination over others? Do individual 
virions carry multimeric or a heterologous 
set of spike variants, and if so, how does 
that affect virus spread and multiorgan 
dispersion? More research is certainly 
warranted to identify molecular hotspots 
and catalysts of SARS-CoV-2 replication- 
or recombination-induced mutants and 
the pathophysiological consequences of 
infection with these mutants. 
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